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ABSTRACT 


This  paper  describes  the  formation  of  a  dictionary  list  structure 
which  can  be  used  by  an  English  language  translator  to  enable  natural 
language  man-machine  conversation  directed  towards  the  retrieval  of 
information  from  a  data  bank.  The  hierarchical  structure  of  the  let- 
ters in  a  word  and  the  placement  of  word  attributes  in  this  structure 
is  discussed. 

A  computer  program,  which  accepts  as  input  an  English  language 
sentence  and  processes  this  sentence  in  conjunction  with  the  dictionary 
list  structure  to  obtain  the  attributes  of  the  individual  words,  is 
described.   The  incorporation  of  this  dictionary  structure  into  a 
complete  natural  language  information  retrieval  system  is  also 
discussed. 
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LIST  OF  DEFINITIONS 


1.  Record  (Logical  Record)  -  A  collection  of  related  data  elements 
treated  as  a  unit.   (E.g.,  the  topic  name  and  its  numeric  code  word) < 

2o   File  -  A  collection  of  related  records  treated  as  a  unit. 

3-   Information  Retrieval  System  -  A  process  developed  to  recover 
specific  information  from  a  data  bank. 

A.  Key-Searches  -  Searches  conducted  to  retrieve  information  in 
accordance  with  a  specific  data  input  parameter  (or  set  of  parameters) 
called  a  key. 

5o  Pointer  -  A  numeric  value  used  to  specify  an  address  in  core 

6.  Null  Pointer  -  A  special  pointer  value  that  cannot  relate  to  any 
address  in  storage  (represented  by  NIL  orX). 

7o  Node  -  One  or  more  consecutive  bytes  of  core  storage  divided  into 
parts  called  fields.   (synonym  —  cell) 

8.  Tree  -  A  finite  set  of  one  or  more  nodes  such  that  there  is  one 
specially  designated  node  called  the  root  of  the  tree  and  the  remaining 
nodes  are  partitioned  into  disjoint  sets  of  trees  which  are  called 
subtrees  of  the  original  tree. 

9.  Binary  Tree  -  A  tree  structure  where  each  node  has  at  most  two 
subtrees.  When  there  is  only  one  subtree  present,  a  distinction  is 
made  between  whether  it  is  a  left  subtree  or  a  right  subtree. 

10.  Hash  Coding  -  A  method  of  effecting  random  storage  where  a  trans^ 
formation  is  performed  on  a  key  (in  this  case  the  topic  name)  to 
produce  an  address  in  the  table  which  can  be  used  to  locate  a  position 
for  placing  the  key  and  its  associated  item. 


I.   INTRODUCTION 

The  Technical  Reports  Section  at  the  Naval  Postgraduate  School 
Library  currently  offers  its  users  access  to  those  reports  in  its 
holdings  by  either  of  two  methods.   First,  the  user  can  search  through 
the  card  index  to  obtain  the  report  number  of  the  item  desired,  or 
second,  he  can  conduct  a  computerized  library  search  to  obtain  a  bib- 
liography of  those  reports  in  the  holdings  which  pertain  to  the  sub- 
ject area  of  interest. 

This  library  search  is  conducted  by  entering  a  numeric  code  word 
into  an  information  retrieval  system  which  will  then  use  this  code 
word  as  a  key  to  search  for  and  retrieve  a  bibliographical  listing 
of  those  reports  which  pertain  to  the  entered  code  word.   Any  number 
of  code  words  can  be  combined,  and  the  items  on  the  bibliography  re- 
trieved for  the  user  will  have  all  of  the  code  words  within  their  list 
of  keys.  At  present  the  Technical  Reports  Section  has  about  100,000 
bibliographical  items  in  its  file,  and  these  items  can  be  accessed 
by  over  10,000  different  code  words. 

Although  a  library  search  is  faster  and  easier  for  the  user,  es- 
pecially when  only  a  general  topic  area  is  desired,  it  does  have  a 
number  of  shortcomings.   First,  there  is  a  nominal  delay  between  the 
request  for  the  search  and  the  obtaining  of  the  bibliography  resulting 
from  that  search.   Second,  the  user  does  not  know  at  the  time  of  his 
request  whether  or  not  his  topic  area  is  too  broad  or  too  narrow  to 
obtain  a  reasonable  bibliography.   Third,  the  user  presently  has  to 
personally  search  through  a  book  of  topic  areas  to  obtain  the  numberic 
code  words  required  for  the  information  retrieval  system. 


This  report  is  the  result  of  a  project  to  develop  a  flexible  and 
accessible  information  system  for  the  specific  objective  of  retrieving 
a  bibliography  of  the  reports  held  by  the  Technical  Reports  Section 
of  the  Naval  Postgraduate  School  Library.   The  proposed  system  would 
be  implemented  on  the  school's  IBM  360/67  computer  and  would  utilize 
the  time-sharing  capability  of  that  system. 

The  system  as  conceived  would  be  similar  to  the  REAL  ENGLISH  system 
at  the  University  of  Pennsylvania  [  1  ] .   It  would  consist  of  an 
English  language  translation  capability  which  would  be  used  to  pre- 
process  the  user's  English  language  request,  which  has  been  entered 
at  the  time-sharing  terminal,  to  determine  the  specific  item(s)  desired. 
Then  the  numeric  code  associated  with  each  item  would  be  retrieved  from 
a  table  and  entered  into  the  existing  information  retrieval  system. 
The  number. of  items  in  the  bibliographical  file  which  met  the  criterion 
of  the  code  word(s)  would  be  printed  out  at  the  terminal,  and  the  user 
could  then  specify  whether  or  not  there  was  a  sufficient  number  of 
bibliographical  items  for  his  use.   If  the  number  of  items  were  not 
sufficient  or  overly  abundant,  the  user  could  re-initiate  his  request 
in  either  a  broader  topic  area  or  a  more  restricted  area  to  obtain 
the  desired  number  of  items  in  the  bibliography.   If  the  number  of 
bibliographical  items  was  sufficient,  the  user  could  specify  that  the 
bibliography  was  to  be  printed  out  at  the  terminal  or  at  the  offline 
printer  at  the  computer  installation.  A  generalized  block  diagram 
of  the  proposed  system  is  shown  in  figure  1,  page  11. 

The  remainder  of  this  report  has  been  written  in  two  parts.  The 
first  part  describes  the  formation  of  a  dictionary  structure  required 
for  the  English  language  translation,  the  processing  of  the  user's 
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FIGURE  1.   System  Block  Diagram 
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request  to  obtain  from  the  dictionary  the  attributes  of  the  individual 
words,  and  the  retrieval  of  the  numeric  codes  from  the  table  of  code 
words.   The  second  part  consists  of  illustrative  examples  of  the 
system. 

The  syntactic  and  semantic  interpretation  of  the  attributes  of 
the  request  and  the  actual  implementation  with  the  existing  informa- 
tion retrieval  system  are  left  for  further  study.   Some  work  in  the 
area  of  syntactic  and  semantic  interpretation  has  been  done  by  Cautin 
[1,2]  and  Kellog  [5,6]  which  might  be  of  value  for  anyone  who  desires 
to  conduct  further  work  on  this  system. 


12 


II.   SYSTEM  DESIGN 

At  this  stage  in  the  development  of  the  system,  there  are  three 
main  computer  programs  or  subroutines.   The  primary  program,  LIB-SER, 
which  is  mainly  concerned  with  performing  operations  on  lists,  was 
written  in  PL/I  [7,8,9]  to  take  advantage  of  the  list  processing  and 
recursive  capabilities  of  that  language.   Also,  PL/I  was  chosen 
instead  of  a  list  processing  language,  such  as  LISP  1.5,  because  of 
the  availability  of  PL/I  at  the  Naval  Postgraduate  School  computer 
installation* 

The  TABLE  program,  which  is  also  in  PL/I,  was  written  exclusively 
for  use  as  a  calling  program  for  the  construction  and  maintenance  of 
the  numeric  codes  table.   PL/I  was  chosen  for  this  routine  mainly 
because  of  its  character  manipulation  capabilities. 

The  final  program,  HASHX,  is  a  subroutine  written  in  IBM  360/67 
Assembly  language  [10,11]  and  is  called  into  operation  by  both  the 
LIB-SER  and  TABLE  programs.  Assembly  language  was  chosen  since  it 
allows  a  closer  control  over  the  representation  and  location  of  data 
while  in  computer  core  storage.   This  close  control  is  necessary 
since  HASHX  is  used  to  move  the  core  character  representations  of  the 
topic  names  and  their  numeric  codes  from  one  position  to  another  in 
core  storage. 

A.   LIB-SER  PROGRAM 

The  LIB-SER  program  is  the  main  program  utilized  by  the  proposed 
library  search  system.   It  is  utilized  to  build  lists,  add  items  to 
lists,  delete  items  from  lists,  and  search  for  items  on  the  lists 
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maintained  during  the  operation  of  the  proposed  system.   The  lists 
of  prime  importance  are  ATOMLIST,  the  dictionary  list  (LI),  the  user's 
request  list  (L2) ,  and  the  multiple  definitions  list  (L3) . 
1.   List  Structure  in  the  Program 

A  list  structure  was  used  in  this  program  since  a  hierarchical 
structure  was  desired  for  the  dictionary  list.   This  hierarchical 
structure  is  used  to  simplify  the  search  through  the  dictionary  for 
the  attributes  of  the  words  in  the  user's  request.   The  inputs  to  the 
program,  the  dictionary  file  and  the  user's  request,  are  built  into 
a  system  of  connected  lists  where  these  lists  take  on  a  form  similar 
to  the  program  and  da.a  structure  utilized  in  the  list  processing 
language  LISP  1.5  [12]. 

In  LISP  1.5,  program  and  data  are  represented  in  the  form  of 
symbolic  expressions  or  S-expressions  where  these  S-expressions  are 
of  indefinite  length  and  have  a  branching  binary  tree  structure  which 
allows  the  identification  and  isolation  of  sub-expressions  or  sub- 
trees within  the  basic  structure.   The  S-expressions  are  graphically 
represented  in  this  binary  tree  structure,  as  shown  in  figure  2  on 
page  15,  as  a  sequency  of  nodes  or  cells,  each  of  which  has  two  fields. 
When  using  the  dotted  pair  notation  in  LISP  1.5S  both  of  the  fields  can 
be  used  to  contain  a  pointer  to  another  node  or  cell  or  they  can  be 
used  to  contain  a  character  string.   In  an  alternative  notation,  list 
notation,  the  left  field  is  the  only  field  which  can  be  used  to  contain 
a  character  string  or  pointer.  The  right  field  contains  a  pointer 
to  the  next  node  in  the  tree  or  a  special  pointer,  NIL,  which  is  used 
as  a  terminator  of  the  list. 


14 


-(A*(B'0) 


A        V 

s 

f       NIL 

/ 

Cl_\ 

\_ 

c 

NIL 

(A(B  C)) 

(b) 
list  notation 


dotted  pair  notation 
FIGURE  2.   Graphical  Structure  of  LISP  1.5  Lists 


The  list  structure  used  in  this  program  is  a  modified  version 
of  the  list  notation  used  in  LISP  1.5.   This  modification  was  required 
by  the  use  of  PL/I  as  the  programming  language  and  the  requirement  of 
this  language  that  the  contents  of  the  fields  in  a  cell  be  explicitly 
declared.   Thus,  no  one  field  can  be  used  to  contain  both  a  pointer 
and  a  character  string  during  a  program  execution.   This  required  that 
the  character  strings  be  placed  in  a  type  of  cell,  called  ATOMCELL  in 
this  program,  defined  separately  from  the  connective  cells,  called 
CONCELL,  used  to  represent  the  tree  structure.   It  also  required,  as 
shown  in  figure  3  on  page  16,  that  each  of  the  character  strings  which 
is  placed  in  an  ATOMCELL  to  have  associated  with  it  two  CONCELLS. 
The  upper  CONCELL  is  used  to  connect  the  character  string  to  other 
character  strings  at  the  same  level  in  the  subtree.   The  lower  CONCELL 
is  used  to  contain  a  pointer  to  the  character  string  contained  in  the 
ATOMCELL  and  serve  as  an  attachment  point  for  any  subtrees  of  that 
character  string. 
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Since  there  are  two  CONCELLS  associated  with  each  character 
string,  a  method  of  determining  whether  or  not  the  left  field  of  the 
CONCELL  pointed  to  an  ATOMCELL  or  to  another  CONCELL  in  the  list  was 
required.   This  was  accomplished  by  the  addition  of  a  third  field  to 
the  CONCELL  type  of  cell.  A  fourth  field  was  added  to  indicate  the 
depth  of  the  occurrence  of  the  cell  in  the  list  structure  and  is 
utilized  during  the  construction  of  the  trees. 


00 

0 

7\ 

01 

1 

X 

01 

1 

A 

(A<BC>  ) 
FIGURE  3.   Graphical  Structure  of  the  Lists  Used.1 

a.  ATOMLIST  List 

The  ATOMLIST  list  is  used  to  store  the  alphabetic,  numeric, 
and  special  characters  which  are  required  b_  the  program  to  be  retained 
in  their  symbolic  form  during  the  operation  of  the  program.   It  con- 
sists of  a  series  of  connected  cells  called  ATOMCELLS ,  shown  in  figure 
4  on  page  17,  which  have  three  fields.   Each  new  cell  added  to  the  list 
is  connected  to  the  first  cell  on  the  list. 


All  of  the  circled  character  strings  are  included  in  the  ATOMLIST 
list  and  only  conceptually  displayed  here.   The  "?\  or   X  used  in  the 
CDRF  field  is  used  to  indicate  the  NULL  pointer. 
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The  three  fields  are  the  SIZE  field,  which  contains  the 
number  of  characters  of  the  item  represented  in  the  cell,  the  PNAME 
field,  which  contains  the  character  representation  of  the  item,  and 
the  NEXT  field,  which  contains  a  pointer  to  the  next  ATOMCELL  on  the 
list. 


SIZE 


NEXT 


PNAME 


FIGURE  4.   Structure  of  an  ATOMCELL 

Each  of  these  ATOMCELLs  is  placed  on  this  list  as  the 
characters  they  contain  are  encountered  during  the  operation  of  the 
program,  with  the  majority  of  them  being  attached  to  the  list  during 
the  initialization  phase  of  the  dictionary.   Each  item  is  included 
in  this  list  only  once  since  a  search  of  the  list  (procedure  LOOKUP) 
is  conducted  every  time  an  item  is  encountered  to  determine  if  an 
ATOMCELL  has  or  has  not  already  been  included  in  the  list  which  con- 
tains the  character(s)  of  the  item.  No  ATOMCELLs  are  deleted  from 
this  list  during  the  operation  of  the  program. 

All  referrals  to  the  character  representation  of  any  item 
on  the  ATOMLIST  is  by  the  pointer  to  the  entire  ATOMCELL.   This  pointer 
will  be  the  same  as  that  contained  in  the  NEXT  field  of  the  ATOMCELL 
immediately  prior  to  the  ATOMCELL  containing  the  desired  characters, 
if  one  exists,  or  else  it  will  be  the  pointer  ATOMLIST  since  the  charac- 
ters would  then  have  to  be  contained  in  the  first  ATOMCELL  on  the  list. 
A  diagram  of  the  ATOMLIST  is  shown  below  in  figure  5. 
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ATOMLIST 


FIGURE  5.   Diagram  of  the  ATOMLIST 

b.   Connective  Lists 

All  of  the  remaining  lists  built  and  used  by  the  program 
are  used  to  connect  the  character  representations  in  the  ATOMLIST 
in  various  fashions,   These  lists  consist  of  a  series  of  CONCELLS 
which  have  four  fields,  an  ATOMB  field,  a  LEVEL  field,  and  two  pointer 
fields,  CARF  and  CDRF.   The  structure  of  the  CONCELL  is  shown  in  figure  6, 


ATOMB 

LEVEL 

CARF 

CDRF 

FIGURE  6.   Structure  of  a  CONCELL 


The  ATOMB  field  is  used  to  indicate  the  purpose  of  the 
CONCELL.   It  is  a  two  bit  field  which  can  contain  one  of  three  binary 
values,  00 ,  01,  or  11.   The  00  value  indicates  that  the  CONCELL  is 
being  used  as  a  connective  cell  where  both  the  CARF  and  the  CDRF  fields 
contain  pointers  to  other  CONCELLs ,  with  the  exception  that  the  CDRF 
field  may  contain  a  null  pointer.   All  items  in  the  same  subtree  at 
any  particular  level  will  be  connected  by  the  CDRF  field  of  a  CONCELL 
with  a  00  ATOMB  field. 

The  01  value  indicates  that  the  CARF  field  contains  a 
pointer  to  an  ATOMCELL.   The  CDRF  field  may  contain  a  pointer  to  an- 
other CONCELL  with  a  00  ATOMB  field  which  is  at  the  next  level  down 
in  the  tree,  or  else  contain  a  null  pointer.   The  11  value,  which  can 
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only  be  placed  in  the  ATOMB  field  during  the  initialization  phase  of 
the  dictionary  list,  indicates  that  the  CARF  field  contains  a  pointer 
to  an  ATOMCELL  which  contains  the  name  of  a  multiple  definition  or  an 
attribute  name.   The  CDRF  field  always  contains  a  null  pointer. 
A  sample  connective  list  is  shown  below  in  figure  7. 


FIGURE  7.  A  Connected  List 


The  LEVEL  field  contains  a  numeric  indication  as  to  the 
level  in  the  tree  structure  of  the  particular  cell.   For  CONCELLs  that 
are  contained  in  the  multiple  definitions  list,  the  value  contained 
in  the  LEVEL  field  is  not  changed  when  that  definition  is  included 
at  other  places  in  the  structure  of  the  dictionary. 

The  use  of  the  multiple  definitions  will  be  explained 
later  during  the  discussion  of  the  initialization  of  the  dictionary. 
2.   File  Descriptions 

Two  files  are  used  by  the  program  during  the  initialization 
phase  to  enter  data  into  the  program,  the  Dictionary  file  and  the  Codes 
file.   Both  files  are  read-only  files  when  used  by  the  LIB-SER  program. 
a.  Description  of  the  Dictionary  File 
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(1)   Convention  and  Special  Characters.  With  the  excep- 
tion of  the  attribute  names  and  the  names  of  the  multiple  definitions 
sequences,  all  items  in  the  dictionary  are  single  characters.   Spaces 
or  blanks  are  used  only  to  improve  the  readability  or  to  separate  items 
in  the  dictionary.   The  attribute  names  and  the  multiple  definition 
names  must  be  at  least  two  characters  in  length  and  the  multiple  de- 
finition names  start  with  the  letter  M. 

There  are  eight  characters  used  in  this  file  which 
have  special  significance  during  the  operation  of  the  program.   These 
characters  are  the  right  and  left  parentheses,  the  greater- than  symbol 
O),  the  less-than  symbol  ((),  the  dollar  sign  ($),  the  minus  sign 
(-) ,  the  asterisk  (*) ,  and  the  slash(/) . 

The  six  characters,  right  and  left  parentheses, 
greater- than  symbol,  less-than  symbol,  the  dollar  sign,  and  the  slash, 
are  used  during  the  building  of  the  list  structure  to  determine  the 
proper  connections  which  have  to  be  made.   The  left  parenthesis  is 
used  to  indicate  the  beginning  of  the  dictionary  file.   The  right  paren- 
thesis is  used  to  indicate  the  end  of  the  dictionary  file.   The  dollar 
sign  is  used  to  terminate  the  processing  of  the  current  input.  The 
less-than  symbol  is  used  to  indicate  that  the  items  that  follow  are 
to  be  attached  to  the  preceding  character  at  the  next  lower  level  in 
the  tree  structure.   The  greater- than  symbol  is  used  to  indicate  that 
items  at  the  current  level  in  the  subtree  have  been  processed  and  that 
the  following  items  are  not  to  be  attached  at  that  level  but  at  the 
next  higher  in  the  tree  structure.   The  slash  is  used  to  indicate  that 
the  next  item  is  the  name  of  a  multiple  definition  and  that  the  pre- 
viously formed  subtree,  which  is  attached  to  that  multiple  definition 
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name,  is  to  be  inserted  at  this  point  in  the  current  subtree. 

The  two  characters,  the  minus  sign  and  the  asterisk, 
are  used  during  the  attribute  determination  phase  to  indicate  special 
conditions  or  operations .   The  minus  sign  is  used  to  indicate  that  the 
following  attribute  is  not  to  be  retained  for  the  word  thus  formed. 
This  attribute  may  or  may  not  have  been  retrieved  for  the  word  under 
consideration.   The  asterisk  is  used  to  indicate  that  a  complete  word 
has  been  placed  in  the  tree  structure  to  that  point  in  the  tree.   It 
is  also  used  as  the  attachment  point  for  any  remaining  attributes  of 
the  word  that  it  terminates. 

(2)   File  Description.   The  Dictionary  File  consists  of 
a  sequence  of  characters  contained  in  a  series  of  sixty-character 
records.   The  file  begins  with  the  left  parenthesis  and  ends  with  a 
right  parenthesis  and  a  dollar  sign  in  the  last  record.  The  first 
character  in  the  sequence  after  the  left  parenthesis  is  the  symbol 
one  (1)  which  is  used  as  an  attachment  point  for  all  of  the  multiple 
definitions .   The  remaining  characters  are  placed  into  the  file  such 
that  the  tree  structure  which  is  built  from  this  file  will  contain 
all  of  the  words  entered  into  the  dictionary  of  the  program. 

At  the  zero  level  in  the  dictionary  tree,  the  twenty- 
six  letters  of  the  Latin  alphabet  are  attached  to  the  preceding 
character  in  descending  order  according  to  their  probability  of  occurrance 
in  the  English  language  [13] .   These  letters  are  used  as  the  first 
character  of  all  words  in  the  dictionary.   Each  of  the  remaining  letters 
in  a  word  is  preceded  by  the  less-than  symbol  to  indicate  that  they  are 
to  be  attached  to  the  previous  letter  in  the  word  at  the  next  lower 
level  in  the  subtree. 
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Where  there  are  attributes  that  are  common  to  a  number 
of  words  with  similar  beginnings,  those  attributes  can  be  included  in 
the  sequence  immediately  prior  to  the  beginning  of  the  dissimilarity. 
Wherever  a  complete  word  has  been  represented,  an  asterisk  is  included 
in  the  sequence  following  the  last  letter  of  the  word.  When  a  word 
being  described  has  a  common  ending,  such  as  the  plural  endings  ES 
and  S,  the  multiple  definition  name  for  that  ending  is  placed  in  the 
sequence  immediately  following  the  slash  symbol,  and  this  multiple 
definition  name  is  used  to  complete  the  word.   Figure  8  illustrates 
the  use  of  the  multiple  definition  name  in  the  sequence. 

S<0<M<E<*  ADVBP»^   E<N<D  <VERBP  *  /MING>»> 

FIGURE  8.   Sample  Dictionary  Record 

b.   Codes  File. 

The  codes  file  consists  of  a  series  of  sixty-character 
records  of  which  only  the  first  twenty  characters  are  utilized.   Each 
record  consists  of  the  topic  name  beginning  in  the  first  character  posi- 
tion and  followed  by  the  five  number  numeric  code  word  in  the  last 
five  character  positions  with  blanks  in  between.   The  records  are  ar- 
ranged in  the  file  according  to  the  hash  coded  index  of  each  item  that 
is  determined  during  the  operation  of  the  TABLE  program  and  its  call 
to  the  subroutine  HASHX  . 
3.   Initialization. 

The  program  has  an  initialization  phase  in  which  the  dictionary 
file  is  read  and  structured  into  the  tree  and  subtrees  required  for 
later  use  in  the  determination  of  the  attributes  of  a  user's  request. 
After  the  dictonary  has  been  structured,  the  table  file  is  read  into 
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an  array  for  use  during  the  search  for  the  numeric  code  words  required 
for  the  information  retrieval  system. 

a.   The  Dictionary  Tree  Structure. 

The  dictionary  tree  structure  is  a  connective  list  which 
has  pointers  in  the  CARF  field  of  the  appropriate  CONCELL  to  point 
to  the  required  symbolic  data  in  the  ATOMLIST.   It  is  formed  by  first 
constructing  the  multiple  definitions  subtree  and  then  constructing 
the  remaining  subtrees,  all  of  which  start  with  the  letters  of  the 
alphabet  and  some  punctuation  characters.  All  of  the  subtrees  are 
constructed  independently  by  the  recursiveness  of  the  structuring 
procedure  (S_EXPR)  until  the  last  subtree  has  been  structured  at  which 
time  the  subtrees  are  connected  to  form  the  dictionary  tree.   The 
recursive  construction  of  the  tree  is  illustrated  in  figure  9  on  page  24. 

Although  the  multiple  definitions  subtree  is  constructed 
in  the  same  manner  as  all  of  the  other  subtrees  in  the  dictionary, 
there  is  a  special  pointer  (L3)  which  is  assigned  to  point  to  this 
subtree  during  the  initialization.   This  assignment  occurs  immediately 
upon  completion  of  the  structuring  of  the  subtree  and  before  the  next 
subtree  is  started.   This  pointer  is  required  during  the  structuring 
of  the  remaining  subtrees  as  a  beginning  pointer  for  the  procedure 
(LOOKLIST)  when  a  search  for  a  multiple  definition  used  in  the  other 
subtrees  is  required.   The  multiple  definitions  subtree  cannot  be  re- 
cursively defined  (i.e.,  a  multiple  definition  which  uses  another  mul- 
tiple definition  as  one  of  its  endings)  since  the  special  pointer  is 
not  assigned  until  the  completion  of  the  entire  structuring  of  the 
subtree. 

During  the  construction  of  the  remaining  subtrees,  the 
multiple  definitions  are  used  whenever  a  slash  is  encountered  during 

23 


1 


01 


01 


(a)  Before  Returning 
From  Recursion 


01 


01 

0 

A 

(b)  Partial  Return 


i 


oo|  oh  I  j — -H  oojoh 


X 


01 


01 


L 

H 


1 


00 

1 

X 

01 


1 


00 

1 

X 

011  (  » 


(c)    Complete  Return 


i 


01 

2 

A 

1 


01 

2 

A 

^V 


01 

2 

>. 

\ 


01 

2 

A 

oo  \2\j\n *\ oo  |2 1 1 


01 

2 

A 

00  |2  h  |  j 100    |2h 


01 

2 

X 

FIGURE  9.   Recursive  Structure  of  Connected  Lists 
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the  scanning  of  the  input  symbols.  When  this  occurs,  the  procedure 
LOOKLIST  is  invoked  to  determine  the  pointer  to  the  subtree  required 
at  this  point  for  the  completion  of  the  word  being  entered  into  the 
dictionary.   The  program  allows  more  than  one  multiple  definition  to 
be  used  as  an  ending  of  a  word  with  the  stipulations  that  none  of  the 
multiple  definition  subtrees  can  begin  with  the  same  letter  nor  can 
the  multiple  definitions  subtrees  begin  with  the  same  letter  as  is 
already  included  in  the  subtree  of  the  word  at  that  level.  The  use 
of  the  multiple  definition  is  illustrated  in  figure  10  on  pages  26 
and  27. 

b.   The  Table  Array 

The  table  file  is  copied  directly  into  the  array  area, 
CODE.   No  processing  occurs  with  the  items  in  the  file  during  the 
initialization  phase  of  the  program.   Currently,  there  is  a  logical 
switch  (true  or  false)  which  has  been  turned  off  (false)  to  prevent 
the  table  file  from  beiri;:  read  into  the  CODE  array.  When  further 
implementation  of  the  proposed  library  search  system  has  been  accom- 
plished, this  switch  can  be  turned  on  to  allow  the  initialization  of 
the  table  array  for  use  in  the  table  search  procedure. 
A.   Processing  of  a  Request 

After  the  initialization  phase  has  been  completed,  the  computer 
will  loop  through  the  remaining  parts  of  the  program  until  the  string 
ST0P$  is  entered  at  the  terminal  by  the  operator. 

a.   Entry  of  a  Request 

The  user  is  requested  by  the  program  to  enter  his  name 
and  student  box  number  (or  other  appropriate  school  mail  box)  to  be 
used  for  identification  purposes.   The  user  is  then  asked  to  enter 
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his  English  language  request  surrounded  by  right  and  left  parentheses 
and  ending  with  a  dollar  sign.   If  the  request  is  too  long  to  be  en- 
tered at  one  time,  the  right  parenthesis  and  dollar  sign  can  be  left 
off  until  the  entire  request  has  been  entered. 

The  request  entered  will  be  built  into  a  tree  (L2)  in  the 
same  manner  as  the  dictionary  tree  where  all  of  the  words  and  punctua- 
tion are  entered  as  single  item.   Figure  11  is  an  example  of  a  typical 
user's  request  tree. 
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FIGURE  11.   Sample  User's  Request 
This  tree  structure  will  then  be  passed  to  the  attribute  determination 
routine  for  processing. 

be   Determination  of  the  Attributes 

In  the  procedure  ATTRB,,  each  word  in  the  user's  request 
is  separated  into  its  constitutent  letters  and  these  letters  are  passed 
one  at  a  time  to  the  LOOKLIST  procedure.   The  ATTRB  procedure  will  also 
pass  to  the  LOOKLIST  procedure  the  subtree,  or  tree  if  the  letter  is 
the  first  letter  of  the  word,  on  which  the  LOOKLIST  procedure  is  to 
search  for  the  letter. 

In  the  LOOKLIST  procedure,  each  letter  is  searched  for  only 
in  the  highest  level  of  the  subtree  passed  to  it.  Any  attributes  which 
are  encountered  before  the  letter  is  found  will  be  attached  to  a  tem- 
porary pointer  (L3)  for  later  combination  with  the  final  attributes. 
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When  the  letter  has  been  found,  the  pointer  contained  in  the  CDRF  field 
of  the  CONCELL  which  points  to  the  ATOMCELL  containing  the  letter  is 
returned  to  the  ATTRB  procedure.   This  pointer  will  be  used  by  the 
ATTRB  procedure  as  the  pointer  to  the  subtree  used  during  the  next 
call  to  LOOKLIST. 

This  searching  will  continue  until  all  of  the  letters  of 
the  word  have  been  found.  After  the  last  letter  has  been  processed, 
the  ATTRB  procedure  will  pass  to  the  LOOKLIST  procedure  an  asterisk 
and  the  latest  subtree.   The  pointer  (Q)  returned  from  this  call  will 
point  to  the  list  of  final  attributes  of  the  word,  if  any. 

At  this  point,  both  the  temporary  attribute  list  and  the 

final  attribute  list  will  be  passed  to  the  DELETE  procedure  for  the 

deletion  of  any  attributes  in  the  temporary  list  which  are  indicated 

as  not  applying  to  the  word  by  the  prefixed  minus  sign  on  the  attribute, 

The  before  and  after  structure  of  the  temporary  and  final  attribute 

lists  are  illustrated  in  figure  12. 
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FIGURE  12.   Temporary  and  Final  Attribute  Lists 
The  temporary  list  and  the  final  list  will  be  connected  after  this 
deletion  has  taken  place,  and  the  resulting  list  will  be  attached  to 
the  word  on  the  user's  request  list  (L2)  as  shown  in  figure  13. 
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FIGURE  13.   User's  Request  with  Attributes  Attached 

As  the  processing  of  each  word  has  been  accomplished,  a 
message  is  displayed  at  the  terminal  to  the  effect  that  the  attributes 
have  been  found.   If  during  the  search  for  the  letter  of  a  word  by  the 
LOOKLIST  procedure,  that  letter  is  not  found,  a  null  pointer  is  returned 
to  the  ATTRB  procedure.  This  null  pointer  will  result  in  the  ATTRB 
procedure  displaying  a  message  at  the  terminal  that  the  attributes  of 
the  word  have  not  been  found,  and  then  the  procedure  will  begin  to 
process  the  next  word  in  the  request. 

After  all  of  the  words  have  been  processed,  the  computer 
prints  out  the  user's  list  which  now  includes  the  attributes  of  the 
individual  words .   The  computer  then  branches  back  to  the  beginning 
and  requests  a  new  user  to  enter  his  name  and  box  number. 

B.   TABLE  PROGRAM 

Since  the  TABLE  procedure  in  the  main  program,  LIB-SER,  was  written 
to  allow  only  the  retrieval  of  the  numeric  code  words  from  the  CODE 
array,  the  auxiliary  program,  TABLE,  was  written.   The  TABLE  program 
allows  the  entry  of  an  item,  the  deletion  of  an  item,  and  the  retrieval 
of  information  contained  in  an  item  in  the  CODE  array. 
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The  TABLE  program  first  initializes  the  CODE  array  by  copying 
directly  into  the  array  the  individual  records  in  the  CODES  file. 
The  records  are  truncated  after  the  twentieth  character.   Once  in  the 
array,  the  program  calls  the  subroutine  HASHX  with  various  parameters 
to  perform  the  functions  of  retrieval,  deletion,  or  storage  of  an 
item  in  the  array.  When  the  required  modifications  to  the  array  have 
been  accomplished,  the  CODE  array  is  copies  back  into  the  CODES  file 
with  each  item  being  padded  with  blanks  on  the  right  to  fill  out  each 
record. 

Although  the  TABLE  program  was  written  to  be  operated  from  the 
time-sharing  terminal,  with  slight  modifications  it  could  be  run  under 
the  batch  processor  at  the  computer  installation.  This  might  be  de- 
sirable if  the  number  of  modifications  is  substantial. 

C.   HASHX  SUBROUTINE 

The  HASHX  subroutine,  which  is  used  to  compute  the  address  of  a 
location  in  a  table  or  array  and  either  return  the  contents  of  that 
location  or  place  an  item  in  that  location,  is  based  on  the  work  done 
in  scatter  storage  techniques  by  W.  D.  MAURER  [14].   The  method  used 
here  is  referred  to  as  a  "division  hash  code"  to  distinguish  it  from 
the  logical  or  multiplicative  methods  most  often  used  in  scatter 
storage  systems  [15]. 

Hash  coding  techniques  can  be  applied  to  any  table  or  array  in 
which  the  access  is  to  be  made  to  the  entries  in  an  unpredictable 
order  and  the  items  are  identified  by  some  key  or  name  associated  with 
their  contents.   In  the  hash  coding  technique,  a  transformation  is 
performed  on  the  key  to  produce  an  address  in  the  table  where  the  key 
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and  the  entry  associated  with  that  key  can  be  placed.   A  good  trans- 
formation is  one  which  will  distribute  these  addresses  uniformly 
across  the  available  table  area. 

In  most  hash  coding  techniques,  the  table  size  is  restricted  to 
being  a  power  of  two,  since  the  common  method  of  obtaining  the  address 
is  to  calculate  a  k-bit  field  which  is  assumed  to  be  a  random  integer 
between  zero  and  2-1,  and  this  integer  is  then  used  as  the  address. 
In  the  method  proposed  by  MAURER  and  used  in  this  subroutine,  the 
table  size  is  a  prime  number,  and  therefore  it  can  be  almost  any  size 
desired. 

1.   Determination  of  the  Hash  Code 

The  hash  coding  method  used  here  consists  of  using  the  first 
eight  characters  of  the  topic  name  as  the  key.   The  first  four  charac- 
ters are  exclusively  OR-ed  with  the  last  four  characters  to  obtain  one 
full  computer  word  (32  bits  on  the  IBM  360/67  computer) ,  and  this 
computer  word  is  divided,  using  integer  arithmetic,  by  the  table  size. 
The  remainder  from  this  division,  which  is  a  number  between  zero  and 
one-less-than-the-table-size,  is  used  as  the  hash  code. 

This  method  of  calculating  the  hash  code  is  comparable  in  speed 
to  the  multiplicative  and  logical  methods  with  the  advantage  that  is 
almost  completely  free  of  the  nonrandomness  which  often  occurs  in  the 
other  methods. 

In  using  this  method  of  hash  coding,  a  change  in  the  table 
size  will  change  all  of  the  hash  codes  for  the  table  entries .   If  the 
table  size  has  to  be  changed,  all  of  the  hash  codes  for  the  entries 
must  be  recomputed.   By  using  a  prime  number  which  is  twenty-five 
percent  larger  than  the  maximum  number  of  expected  entries  for  the 
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table,  the  average  number  of  steps  required  to  find  an  item  in  the 
table  is  less  than  two,  i.e.,  the  original  hash  code  address  and 
one  collision  modification. 
2.   Collision  Handling 

In  this  method,  as  in  all  other  hash  coding  methods,  when  two 
keys  have  the  same  hash  code,  a  "collision"  is  said  to  have  occurred. 
When  a  collision  occurs,  the  item  causing  the  collision  must  be  located 
out  of  place  in  the  table. 

There  are  a  number  of  methods  for  determining  a  location  for 
this  item,  such  as  searching  linearly  forward  in  the  table.  Another 
approach  is  to  add  a  random  number  to  the  collision  hash  code  to  de- 
termine a  new  location.  Neither  of  these  methods  is  very  satisfactory 
since  they  are  intrinsically  slow  if  there  is  any  sort  of  clustering 
of  the  items  in  the  table  (many  items  with  the  same  hash  code  place 
next  to  each  other) . 

The  method  of  handling  collisions  used  in  this  subroutine  is 
the  "quadratic  search".   In  this  method,  a  quadratic  equation  is  used 
to  calculate  the  new  location.   The  hash  code  which  caused  the  colli- 
sion is  used  as  the  constant  term  in  the  equation  and  the  other  coef- 
ficients depend  upon  the  table  size.   The  quadratic  search  was  shown 
by  MAURER  to  always  search  for  the  item  or  an  empty  location  by  looking 
at  exactly  half  of  the  remaining  table  locations  if  the  table  size  is 
a  prime  number.   In  using  this  method  of  handling  collisions,  the  table 
is  declared  full  when  the  quadratic  search  has  searched  through  half 
of  the  table. 
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III.   AN  EXAMPLE 

To  demonstrate  the  operation  of  the  LIB-SER  program,  only  a  repre- 
sentative dictionary  is  utilized.   For  a  total  implementation  of  this 
system,  the  dictionary  would  have  to  be  greatly  expanded.   The  dictionary 
used  for  this  demonstration  contains  seventy  words . 

The  program  is  loaded  for  operation  at  the  time-sharing  terminal 
by  entering  LDRM.   LDRM  is  a  time-sharing  executive  file  which  causes 
the  program  and  the  system  library  to  be  loaded  into  the  computer  and 
starts  the  execution  of  the  program.   If  t    non-blank  character  had 
been  entered  after  LDRM,  the  program  would  request  the  operator  to 
enter  optional  parameters  for  use  by  the  program.   These  parameters  are 
logical  switches  which  can  be  used  to  display  traces  through  the  program 
or  selectively  enable/disable  the  various  routines  in  the  program. 

After  execution  has  begun,  and  any  optional  parameters  entered, 
the  program  initializes  the  dictionary  tree  structure.   The  computer 
will  then  ask  a  user  to  enter  his  request,  which  will  then  be  processed 
to  obtain  the  attributes  of  the  individual  words.   Figure  15  illustrates 
a  complete  run  for  an  individual  user. 

The  difference  between  the  attributes  for  a  word  which  has  been 
processed  with  the  DELETE  procedure  disabled  and  the  same  word  when  it 
has  been  enabled  is  illustrated  below  in  figure  14. 

The  computer  will  then  loop  back  to  ask  a  new  user  to  enter 
his  request.   The  program  can  be  stopped  by  entering  STOP$. 
(INFORM <NOUNP  -NOUNP  VERBP>  )         ( INFORM <VERBP>  ) 

(a)  (b) 

DELTE  DISABLED  DELETE  ENABLED 

FIGURE  14.   Effects  of  DELETE  Routine 
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ENTER  NAME  AND  SMC  NUMBER 
_c.t.  schmidt  box  1962 

ENTER  REQUEST,  SURROUND  WITH  (  )$ 

_(what  do  you  have  on  computers?) $ 

WHAT 

DO 

YOU 

HAVE 

ON 

COMPUTERS 

? 

****************************** 

C.T.  SCHMIDT  BOX  1962 
****************************** 


ATTRIBUTES  FOUND  FOR  WHAT 

ATTRIBUTES  FOUND  FOR  DO 

ATTRIBUTES  FOUND  FOR  YOU 

ATTRIBUTES  FOUND  FOR  HAVE 

ATTRIBUTES  FOUND  FOR  ON 

ATTRIBUTES  FOUND  FOR  COMPUTERS 

ATTRIBUTES  FOUND  FOR  ? 


(WHAT<INTER  ADVBP>  DO  (VERBP>   YOU<PRNOUNP>   HAVE<VERBP  PRN3SG> 
ON<PREPP>   COMPUTERS  <NOUNP  PLUR>   ?  <PUNTI>  ) 

**  AT  THIS  POINT  YOUR  REQUEST  WOULD  BE  PASSED  TO  THE 
TRANSLATION  AND  TABLE  SEARCH  ROUTINES.  ** 


FIGURE  15.   Sample  Program 


35 


IV.   CONCLUDING  REMARKS 

This  paper  has  discussed  a  dictionary  list  structure  which  could 
be  used  by  an  English  Language  translator  to  enable  natural  language 
man-machine  conversation  directed  towards  the  retrieval  of  information 
from  a  data  bank.   The  development  of  the  English  Language  translator 
utilizing  this  dictionary  structure  and  the  inclusion  of  this  translator 
into  the  existing  information  retrieval  system  used  by  the  Technical 
Reports  Section  at  the  Naval  Postgraduate  School  Library  would  provide 
the  school  a  flexible  and  accessible  information  system  for  use  in 
obtaining  a  bibliography  of  the  reports  held  by  the  library. 

A  translation  procedure  which  is  based  on  a  generative  model  of 
syntax  and  semantics  that  is  comprehensive  enough  to  automatically 
resolve  some  forms  of  syntactic  and  semantic  ambiguities  would  be  re- 
quired for  this  system.   The  translator  would  have  to  be  able  to 
syntactically  parse  the  user's  request,  using  the  attributes  determined 
from  a  dictionary  structure  such  as  described  in  this  paper,  and  then 
use  this  parse  in  conjunction  with  the  semantic  environment  of  the 
system  to  determine  the  topic  areas  of  interest  from  the  user's  request. 
In  the  accomplishment  of  this  goals  it  should  have  some  means  of  com- 
munication with  the  user  to  resolve  any  ambiguities  which  it  is  not 
otherwise  able  to  resolve.   Once  the  topic  areas  of  interest  have  been 
determined,  the  system  can  readily  produce  the  numeric  code  words 
required  to  obtain  a  bibliography  from  the  existing  information 
retrieval  system. 


36 


**  LIBRARY    SFARCH    SYSTFM  ** 

**  LIF_SER    PROGRAM  ** 

LIB    SER:     PROC(PARMS)     OPTIONS ( MAT N) : 

DCL    CODES    FILF    STR«=AM    ENVIRONMENT    (MRO)l, 

DICT    FILE    STRE&M    ENVIRONMENT    (c(BO)): 
DCL    PARMS    CHAR(8)VAR,C00E(1U    CHAP(20): 
DCL    (ATOMLIST, P, NIL)  c>TP, FSIZE    PIXEO    BINT 

*  DECLARATION    nc    THE    FIELDS    Oc    THE    CELLS    ON  * 

*  THE     'ATOMLIST'    LIST.  * 

1    ATOMCELL    BASED(P) , 
2    SIZE    FIXED    BIN, 
2    NEXT    PTP , 
2    PNAME    CHAP(CSI7E    RE^ER ( SI ZE) ) , 

*  DECLARATION    hf    the    FIELDS    OF    THE    CELLS    ON  * 

*  LI,    L2,    AND    L3    LISTS.  * 

1  CONCELL  BASED(P) t 
2  ATOMB  BIT(2) , 
2  LEVEL  FIXED  BIN, 
2  CARF  PTR, 
2  CDRF  PTR; 

/A^A^AAA^AAAAAAAA*AAAA^A*A*^ci!t^t5!c**^t3Jc3Je^:3^j!t*^c*^t5)f'55c'*^:^t3^35t>|; 

*  DECLARATION    OP    A    FT LE    CONTROL    BLOCK    USED  * 

*  DURING    THE    READING    AND    WRITING    HF    THE  * 

*  FILES    USED.  * 

DCL    I    FCB    STATIC, 

2    COMMAND    CHAD ( P) ? 

2    FILENAME    CHAR(P)     I  NT T ( • DI CT « )  , 

2    FILETYPE    CHAP(P)     I  NT T (  • DATA • )  , 

2    CARDNUM    FIXED    PIN, 

2    STATUS    FIXED    FIN    INIT(O), 

2    CAPD_BUFF    CHAR( PO) : 

*  PROCEDURE    WHICH    i?    US^n    m    T  NT"r  I  ALt  7E    "<"HE  * 

*  DICTIONARY    AMD    THE    TABLE    ARRAY.  * 

DCL     IN  IT    FMTPY: 
INTT:    PROC; 

DCL    I    PIXED    PIN: 

ATOMLIST, NIL=NULL: 

FLD    =     ' 1 «B  ; 

COMMAND    =     'POPUP': 

DISPLAY(  'INITIALIZING    FROM    FILE'): 

LI    =    READS: 

COMMAND    =     'FINT  S'  : 

CALL    IHEFTLE(FCP) : 

FLD    =     'O'P : 
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IF    TRI    THFN    CALL    PR  I  NTS ( CDR ( LI ) )  : 
IF    TBL    THEN    DO; 

COMMAND    =    'RDRUF  «  ; 
FILFNAME    =    »CnnES' : 

DO    I    =   1    m    trlsi  7E; 
CARDNUM    =    T  : 
CALL    IHEFTLE(FCB)  : 
COHE( I)     =    CARD    BIJPC  ; 

IF    TRI    THFN    DISPLAY! ' CnDE (  '  I  |T  |  I  •  )        »|! 
CODE (I ) ) ;    ENH; 
COMMAND    =    'PINTS': 
CALL     IHEFILE(FCB) :    END: 
DISPLAY(,CND    OF    INITIALIZATION*); 
END    IN  IT; 

/************************************************** 

*  PROCEDURE    WHICH    IS    USED    TO    ENTER    ITE^S  * 

*  INTO    THE    PROGRAM.  * 
**************************************************/ 

DCL  INPUT  ENTRY(CHAP(72) ) : 
INPUT:  PROC(R): 

DCL  B  CHAR(72),(S  CHAR(8),I  FIXCD  RTN  I NIT (0 ) ) ST  AT IC: 
IF  FLD  THEN  DO; 

I  =  I+l: 

CARDNUM  =  I; 

CALL  IHEFTLE(PCR) : 

B  =  CARD  BUFF: 

IF  tri  THEN  DI  SPLAY( 'BUFPE0   'MB):  END: 
ELSE  DISPLAY!  'ENTER  STRING,  SURROUND  WITH  (  )*') 

REPLY(B) : 
END  INPUT; 

/************************************************** 

*  PROCEDURE    WHICH    IS    USED    TO    PRINT    OUT    ITEMS         * 

*  FROM    THE    PROGRAM.  * 
**************************************************/ 

DCL  OUTPUT  FNTRY(CHAP(*) ) : 
OUTPUT:  PROC(C): 

DCL    C    CHAM*), I     FIXED    BIN    I  NIT  (0)  STATIC  ? 

IF  FLD  THFN  DO: 

I  =  I+l; 

CARDNUM  =  I; 

COMMAND  =  'WRRIF  ■ : 

CALL  IHEFILE(FCB) :  END: 
ELSF  DISPLAY(C) : 
END  OUTPUT; 

/************************************************** 

*  PROCEDURE    USPD    TO    DETERMINE    tHF    FUNCT!ON  * 

*  OF    THE    CELL    SPECIFIED    BY    THE    POINTER.  * 
**************************************************/ 

DCL    ATOM    ENTRY(PTR)     RE TUPN S ( BI T( 2) ) : 
ATOM:    PROC (A)    BIT( 2) : 

IF    A  =  NIL    THEN    R^ TyR N(  '  01 ' B)  : 

RETURN! A->ATOMB) : 

END    ATOM; 

/***** ********** ***************************** A***** 

*  PROCEDURE    WHICH    CONSTRUCTS    A    CHARACTER  * 

*  POINTFR    CELL     (A    CELL    WITH    ETTHFP     A    11    OR    01  * 

*  ATOMB    FIELD)     AND    tnjccrjs    THC    POINTER  * 

*  TO    THE    CHARACTEMS)     ON    THC     'ATOMLIST»     LIST  * 

*  IP    ALPEADY    ON    ^H6    LIST    OR    ELSE    PLACES  * 

*  the    CHARACTERS)     ON    thp     'ATO^LTST'     LI^T  * 

*  AND    INSERTS    THE     POTNTEP    TO    THIS     NEW    CELL.  * 
**************************************************/ 

DCL     STRING    ENTRY(CHAR(*)v/AR)     RETURNS  (PTQ  )  : 
STRING:    PRCC(C)    DTP : 
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DCL    C    CHAR(*)VAR , (0    &TR,L    ^TXcD    BTN)STATIC: 

IF    TRC     THEN    0! SDL AY ( « STPI NG     '!|C!|«     AT    LEVEL'llLFV) 

0    =    LOOKUR(C) : 

ALLOCATE    CONCELL    SET(D): 

L  =  LENGTH(C) : 

IF  L>1  THEN 

IF    FLO    THFN    ATOMB    =     'II «R: 

ELSp    ATOMB    =    «  01  •  8  : 
ELSE    ATOMB    =    '01 'P: 
CORF    =    NIL; 
LEVEL    =    LEV; 
IP    0    =    NIL    THEN    00; 

CSIZF    =    L; 

ALLOCATE    &TOMCELL    S«=T(0): 

0->NEXT    =    ATOMLI ST; 

0->PNAME    =    C ; 

ATOMLIST    =    0;    ENn; 
CAPF    =    0; 
RETUPN(P)  ; 
END    STRING; 

*  PROCEDURE    WHICH    SEARCHES    THROUGH    THE  * 

*  'ATCMLIST'    LIST    to.    fIND    the    rf|_L    LOCATION  * 

*  OF    THE    INPUT    CH AP ACTER ( S) .  * 

DCL    LOOKUP    ENTRY(CHAR(*)VAP)     RETURNS ( PTR ) ; 
LOOKUP:     PROC(C)    PTR; 

DCL    C    CHAP(*)VAR,Q    PTR    STATIC; 
0    =    ATOMLIST; 

DO    WHILE(0-.  =  NIL)  : 
IF    0->PNAMF    =    C    THEN    RETURN(O): 
0    =    0->NEXT;    END; 
RETURN(NIL) ; 
FND    LOOKUP; 

*  PROCEDURE    WHICH     SCAMS    THE    IMPUT    R'JPFER  * 

*  AREA    AND    PETURNS    VALID    CHARACTER    STRINGS  * 

*  WITHOUT    RLANKS    TO    the    CALLING    POINT.       THE  * 

*  VALID    CHARACTER    SWINGS    ftRE    SINGLE    SPCCTAL  * 

*  (EXCEPT       •-«)     AND    ALFANUMCRK    STRINGS  * 

*  INCLUDING    THE     •-'     CHARACTER.  * 

DCL    SCAN    RETURNS(CHAR (80)VAR) ; 
SCAN:    PROC    CHAR(80)VA°: 

DCL     UBP    INITC72) ,B)FIXED    BIN,(T    CHAM1), 

BUFF    CHAR(72) ) ) STATIC; 
B    =    0; 

DO    WHILc(B=0) : 

IF    BP    >=    72    THEN    HP: 

BP    =    0:    CALL    INPUT(BUFF):     END: 
BP    =    BP+1: 

IF    SUBSTR(BU"  ,BP,l)-»='     '    THFN    B  =  BP;     END: 
T=SUBSTR(BUPP ,B,1 ) : 
IP    T    <     'A«     E    T    -.=     »-'     THEN    R«=TURN(T); 

DO    WHILE(BP<72    G    T    -,=  «     •     £     (T    =  »-•     |     T    >='A')): 
BP    =    BP  +  1:     T=SUBSTR(PUFF,RPtl  )  ;    END: 

RETURN( SUBSTR(BUPF,8,BP-8+l) ) ; 
END    SCAN: 

/  sic  s}:  £  dc  ^t  #  :£  jfc  £  #  #  3}c  £  j$c  if  if  A  jjt  if  if  if  if  if  if  if  $t  £  $  *  $  3$:  s}-  if  #  ■%.  if  if  if  if  $  if  if  iz  if  if  if  if  if  it  if 

*  PROCEDURc  WHirH  CONSTRUCTS  A  CONNECTIVE       * 

*  CELL  (A  CELL  WITH  A  00  ATOMB  FIELD)  * 

*  WITH  THE  INPUT  POINTEPS  AND  THE  * 

*  CURRENT    LEVEL.  * 

DCL    CONS    ENTRY(PTR,PTR)     RETURNS ( PTR) : 
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CONS:    PROC(A,B)    PTR: 
DCL    (ATB)PTR: 
ALLOCATE    CONCELL    SET(P): 
ATOMB='00'R; 
LEVEL    =    LEV: 
CARF=A  : 
CDRF=B: 
RETURNM  P)  : 
END   CONS: 

/*^********^***  ***************  ********************** 

*  PROCEDURE    WHICH    RETURNS    THF     POINTER  * 

*  CONTAINED    IN    THE    CARF    FIFLD    OF    tHP  * 

*  SPECIFIED    CELL.  * 

Ajjc^c^c  +  ^sjc^c^c^t^c^^t^csfc^cjjeAj^jjcjftjJtsjfAjjt************  ******  *******/ 

DCL  CAP  ENTRY(PTP)  RETURNS ( PTR ) : 
CAR:    PROC(A)    PTR: 

DCL    A    PTR: 

IF    A  =  NIL    THEN    RETURN(NIL): 

IF    A->ATOMB    THEN    PETURN(NIL): 

RETURN (A->CARF) : 

END    CAR : 

*  PP0CEDIJPC    WHICH    RETURNS    THE    POINTER  * 

*  CONTAINED    IN    THE    CDRP    FIELD    PF    THE  * 

*  SPECIFIED    CELL.  * 

*$**&$$*******************************************/ 

DCL    COP     ENTPY(PTR)     RETl)RNS  (  PTR  )  : 
CDR:    PROC(A)    PTR: 

DCL    A    PTR: 

IF    A=NIL    THEN    RETURN(NIL); 

RETURNf A->CDRP) : 

END   CDR : 

*  PROCEDURc    WHICH    RETURNS    THE    POINTER  * 

*  CONTAINED    IN    THE    CARF    FTELD    OF    THE    CELL  * 

*  POINTED    TO    BY    THE    CARF    FIELD    OF    THE  * 

*  SPECIFIED    CELL.  * 

r^;  3^c  ajc  a^»;  j^c  ^  ^r  s^c  i^:  3^c  3^.  ^t  5^c  ^i  s^r  a^f  ^fr  ^  X1-  ^t"  ^  -^  ^t^  ^^  ^  ^  ^  ^fc  5jt  sjt  a^  ^t:  a^t  5*t  ift  3^c  3^:  5^c  5^:  ^Jc  ^  3^c  ^<  s^t  >Jr  ^4^  ^=;  5r*r  5^  >^r  ^ 

DCL  CAAR  ENTRY(PTR)  RE TURNS( PTR ) : 
CAAR:    PROC(A)    PTR: 

DCL    A    PTR: 

IF  A  =  NIL  THEN  RETURN(NIL): 

A  =  A->CARF: 

RETURN(A->CARF) ; 

END    CAAR: 

/jfcsfc£#5fc*******  +  ***:fr****5£******  :&£:$:*****:$:*£*********** 

*  PROCEDURE     WHICH    WILL    PRINT    OUT    ALL    pF    THF  * 

*  INDIVIDUAL    CHARACTERS    ON    THF    LIST    SPECIFICD.    * 

DCL    PRINT    ENTRY(PTR) : 
PRINT:    PROC(A)    RECURSIVE: 
DCL    A    PTR: 
IF    A-=NIL    THEN 

IF    A->ATOMB    THEN    CALL    OUTPUT ( SYMBOL ( A) ) : 
ELSF    DO: 

CALL    PRINT(  (A->CARF)1  : 
CALL    PRINT( (A->CDRF) ) :    ENP: 
END    PRINT: 

*  PROCEDURE    WHICH    RETURNS    THF    CHARACTERS  * 

*  CONTAINED    IN    THC    PNA^E    FI^LD    <^f    the  * 

*  ATOMCELL    SPECIFIED  * 

*****it****************#******^********j!e^**  *********/ 
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DCL     SYMBOL    ENTRY(PTP)     RETUPNS(  r~H  AR  (  «0)  VAP  )  : 
SYMBOL:     PROC(X)    CHAR(80)VAP: 

DCL     (X»Y)PTR: 

I^    X=NIL    THEN    RCTUPN(  'NT  L1)  : 

Y  =  X->CARP: 

RETURNK  Y->PNAME)  : 

END    SYMBOL; 

/************************************************** 

*  PROCEDURE    WHICH    IS    USED    TO    CALL    TH<=  * 

*  PROCEDURE    TO    FORM    THE    LIST    STRUCTURE.       IT  * 

*  IS    ALSO    USED    T0    ESTABLISH    A    POINTER    tq    THE  * 

*  MULTIPLE    DEFINITIONS    LIST    DURING    THC  * 

*  THE     INITI ALI 7ATION    PHASE.  * 

**************************************************/ 

DCL    READSS    ENTRY    RFTURNS ( °TP  )  ; 
READSS:    PROC    PTP    RECURSIVE: 

DCL     ((0,P)PTRtCK    FIXED    *I  N    I  NIT (0) ) STATIC: 
ERR    =     'O'B:    CK    =    rK+l ; 
0    =    S    FXPR(  'O'B)  : 

IF    ERR"    THEN   CALL    OUT»UT(MLL    FORMFQ    EXPRESSION'): 
IF    CK    =    1    THEN    DO: 
PP,L3    =    0: 
I*    TRI     THEN    Dn: 
P     =    CAAP( (L?) ) : 

DISPLAY( 'FIRST    ITEM    ON    MULTIPLE    DEFINITION    • 
I  I  'LI  ST      «|  |P->PNAME) :    END: 
CK    =   CK+1:    END: 
prr    =    »C'R;    CK    =   CK-i : 
RETURN(O) : 
END    READSS: 

/************************************************** 

*  PROCEDURE    WHICH    IS    USED    TO    CALL    THE    LEV«=L  * 

*  ZEPC    PROCEDURE    WHTCH    BUUOS    TH<=    LIST  * 

*  STRUCTURE    AND    CONTROLS    the    DURATION    Oc  * 

*  THE    BUILDING.  * 

DCL    READS    ENTRY    RETURNS* PTR) : 
READS:     PROC    PTR    RECURSIVE: 

DCL    0    PTR    STATIC: 

ERR    =     'C'B: 

0    =    S    EXPP( «1«B) : 

IF    ERF    THEN    CALL    OUTPU"M»ILL    FORCED    EXPRESSION'): 
DO    WHILE(SCANi=« $« ) :    ENH; 

FRP    =    'O'B: 

RETURN(O)  ; 

END    READS: 

/******  *************  >'********  ********************** 

*  PPOCEDURc    WHICH    BUILDS    THC     'LISP    LIKE'  * 

*  STRUCTURE    OF    THE    LISTS    USED    BY    THE    PROC-PAM.       * 
************************** ***************** *******/ 

DCL    S    EXPR    ENTRY(BITfD)     RETUP  NS  (  PTR  )  : 
S    EXPR:     PROC(DOT)     PTR    RECURSIVE: 

DCL    DOT    BIT( 1)  ,(0,R)PTR t (T, SYMKHAP (BO)VAR    STATIC: 
DCL    PIGHTSIDE    RETUR NS ( ptr) ; 
RIGHTSTDE:     PROC    PTP  : 
DCL    P    PTR: 

IF    DOT    THEN    PETURN(O) : 
R=S    EXPR( 'O'B) ; 
IF    FRR    THEN    R^TURN(NTL): 
LEV    =    0->LEVFL: 
P    =   0->CAPc: 
IF    TRC    THEM    DI SPLAY (' LEFT    CONNPCTTVF    SYMROL       • 

1  |P->PNAME  I  I  '        AT    LEVEL'MLEV); 
IF    R    -,=    NIL    THEN 

IF    R->LEVEL    =    LEV+1    then    DO: 
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0->CDPF    =    R: 

p    =   s   fxpr  (  '0'  P)  :    ENo: 

RETURN(CONSTOtP)  )  : 

ENn   Pir:,HTSTDE: 
T=SCAN; 
IF    T='(  •     THEN    DO: 

0=S    FXPR(  'O'B)  : 

IF   Frr    THEN   return(nil): 

RETURN(RTGhTSIDE)  ;    CND: 
IF    T    =     •<»     THEN    DO: 

LEV    =    LEV   +1; 

RETURN! PEADSS)  :    END: 
IF    T    =  »>•     THEN    nn: 

IF    DOT    THEN    ERR    ='1 ■ B: 

ELSE    RETU»N(NI  L)  :    END; 
IF    T= ' ) «    THEN   DO: 

IF    DOT    TH=N    CRR     =     '1  'B: 

ELSE    RFTURN(NT  L)  ;    END: 
IF    T    =    « /«    THEN   on: 

SYM    =    SCAN; 

L3    =    PD; 

ENC    =«1'B: 

0  =  CAR(L00KLTST(L3TSYM) ) ; 

ENC    =«0'B: 

R    =    S    EXPR(  'O'B)  ; 

RETURT\(CONS(0,P) ) :    END: 
ELSE    DO; 

IF    T    =     «NT  L'     THFN    0    =    NIL: 

ELSE    0    =    STRING  (  T)  : 

RETURN(BIGHTSIDE) :    END; 
RETURN(NIL)  : 
END    S_EXPR; 

/A************AA*****  *  A*  *  A*  *  A  AAA*  A**  A*  AAAAAAAAA**** 

*  PROCEDURE    WHICH    IS    USED  '  TH    CONVFRT    TH«=    ITE^C    * 

*  IN    THF    LIST    STRUCTURE    INTO    A    FORMAT    CAPABLE      * 

*  OF    BEING    PRTNTED    out.  * 

**  A  A****  AAA  ***  *  ***  **  A  **  *  **  ****  ****  ****  A A****  AAA A**  / 

DCL    BUILD    ENTRY(PTR,BIT(1) )     RETURNS (CHAR (1000) VAR ) : 
BUILD:    PROC(PtB)    CH AR ( 1 000) VAP    RECURSIVE: 
DCL    P    PTP,B    BIT(1 ) ; 
IF    P=NIL    THEM    PETUR N( ' NI L • ) : 
IF    ATOM(P)     THEN    DO: 

IF    P    -=NIL    6    P->CDPF    i  =  NTL    THEN    RCTURN  ($y  MBOl  ( P ) 

I  I ■<« I !BUILD( P->CDRPt 'l'B)l!'>'); 
ELSE    RETURN(  SYMBOL(  P)  )  :    CND: 
IF    ATOM(CDR(P))     THEN    00: 
IF    CDP (P)     =    NIL    THCN 

IF    B    THEN    RFTURN(B(JILD(CAR(P)  ,  »0'B)  ) 
ELSE    RETURN(  ' (  »  !  I  BUI LD( C AP ( P)  , •! ' P) j|  '  )  ■  ):     END: 
IF    B    THEN    RETURM(BUILD(CAR(P) t 'O'B) 

BUILD(CDP(P) t '1 «B) ) : 
RETURN  (  «(  •  HBUILD(P,'1'B)  I  I  ')  •)  : 
END    BUILD; 


,  ;  !  «  P 
IP    'I 


/A AAA*** AA***************************AAAAAAAAAAAA** 

*  PROCEDURE    WHICH    IS    USED    TO    CALL    TH^  * 

*  PROCEDURE    WHICH    IS    USED    T0    PR*NT    OUT    THF  * 

*  ITEMS    STRUCTURED    *Y    THE    &UILD    PROCEDURE.  * 
AA*A* ********** A* A A ******* ************** A A A*******/ 

DCL    PRINTS    ENTRY(PTR) ; 
PRINTS:    PROC(A); 

DCL    A    PTP,B    CHAR  (1.000)  VARt  (L,I)CTATIC    FIXEO    RIN: 

DISPLAYt 'FORMATING    LIST    FOR    printing*): 

IF    TRI    THEN    DISPLAY( 'DICTIONARY    PRINTING    TAKES    • 

I  I  «A     LONG    TIMc  i  )  • 
B    =    BUILO(A,  'O'B) : 
L    =    LFNGTH(B) ; 

DO    1=1    TO    L    BY    60: 

CALL    OUTPUT( SUBSTR(B,I , MI N(60 , L-T+l)  )  ) ;     END: 
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IF    FLD    THEN    CALL    OUTPUT (•$•) ; 
END    PP INTS: 

*  PROCFDUPc    WHICH    TS    USED    Tp    SEARCH    DOWN    A  * 

*  SPECIFIED    LIST    TO    FTND    THE    CHARACTER (S)  * 

*  SPECIFIFH    BY    thp    PARAMETER.       THF    PPINTEP    TO  * 

*  THE    NEXT    ITFM     'DOWN'    TH^    LI  ^T    *S    RETURNED  * 

DCL    LOOKLIST    ENTRY ( PTP , CH AR ( *) VAP )     PFTURNS ( PTR ) : 
LHOKLIST:     PROC(P,C)     PTP: 

DCL    (P,OtR)PTRtC    CHAP(*)VAR; 
DO    WHILE    (P-.  =  NIL)  : 
0    =    P->CAPF; 
R    =    CAAR( ( PJ ) : 

IF    R->PNAME    =    C    THEN    RE  TURN ( 0->CDR F) : 
ELSE     IF    -iENC    THEN 

IF    0->ATOMR    =     «U«B    THEN 
IF    L3    =    NI  L    THEN    D^: 
L3    =    CONS(Q,NTL) : 
L?->LEVEL    =    1  ;    END; 
ELSE    DO: 

L3    =    CONS(Q,L3) : 
L3->LEVEL    =   1 :    END: 
P    =    P->CDPC :    BP   =    P;    END: 
RETUPN(NIL) : 
END    LOOKLIST: 


*  PROCEDURE    WHICH    IS    USED    TO    ATTACH    TH    &LL  * 

*  ITEMS    ON    THP     'L2'    L^ST    (THF    USER    DFFINED  * 

*  LIST)     ANY    ATTRIRUTES    OF     THAT    TTEM    FOUND    ON  * 

*  THC     'LI'     (DICTIONARY)     LIST.  * 

DCL    ATTRB    ENTRY: 
ATTRB:     PROC: 

DCL    (P,Q,R,S,T)PTR,C    CHAR (20) VAP ,L    FIXED    «IN, 

CI    CHAR( 1) : 
R    =    L2:    L3    =    NIL: 

DO    WHILE(R    -=NTL) : 

P    =    LI: 

S    =    R->CARe; 

T=CAAR( (R) )  : 

C    =    T->PNAME: 

L   =   t->size; 

DO    I    =1    TO   L: 
CI    =    SUBSTP(C,I ,1) 
0    =    LOOKLTST( RtC! ) 
IF    0    =    NIL    THEN    DO 

DISPLAY( 'ATTRIBUTE    NOT    FOUND    FOR    **    MIC): 

GO   tq    LB1 :    END: 
ELSE    P    =    0:    END 


*')  ; 

S->CDRF    =    L3 


0    =    LOOKLI ST(P t 
0    =   DELETP(O) : 
IF    0    =    MIL    THEN 
ELSE    DO: 

IF    L3    -.=    NIL    THEN    DO: 
S-XDPF    =    L3: 

DO    WHI  LE(  L3->CDRF-,=  N!L) 
L3    =    L3->CDRP:    END: 
L3->CDPP    =    0:    END: 
ELSF    S-XDRP    =    0:    ENn: 
DISPLAY(  'ATTRTBUTES    FOUND    C0D 
LBl:    R    =   P->CDRF:    L3    =   NIL:    END: 
END    ATTPB: 


I  !C) 


*  PROCEDURE    WHICH    T ^    USED    TO    DCL"TE    cROy    THE         * 

*  LIST    OF    ATTRIBUTES    THOSF    ATTRIBUTES    WHICH  * 

*  WERE    PICKED    UP    DUPING    THE    SFARCH    WHICH    DO  * 
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*  NOT  APPLY  TO  THE  ^ECIFTC  ITEM.  * 

**************************************************/ 

DCL    DELETE    ENTRY(PTR)     RETURNS! PTR) : 
DELETE:     PROC(0)PTR; 

OCL    <P,Q,R,S,T,BQ)PTR,C    CHAR(20)VAR: 
BO    =    0; 

DO    WHILE*0    -.=    NT  L)  : 
S    =    CAAR( ( 0) ) : 

IF    SUBSTR(S->PNAME,1 ,1)     =    •-'    THEM    DO: 
BO    =    0->CDPF; 
C    =    SUBSTR( S->PNAMET2) : 
P    R     —    L3  * 

DO    WHILF(P    --=    NIL)  : 
T    =   CAAR( *R)  )  : 
IP    t_>p^amp    =    c    THEN    DO: 
T    =    R->CDRP: 
TP    p    =    R    THEN    L3    =    T: 
ELSE    P->CDRF    =    T; 
GO    TO    LB?:    END: 
PLSP    R    =    R->CDRF:    ENO: 
LB2:    0    =    BO:    END: 

ELSE    PETURN(RO) :    END: 
RETURN(BO) ; 
END    DELETE; 

/************************************************** 

*  PROCEDURE    C0P     THE    ON    LINE    ENTRY    OF    ITEMS  * 

*  INTO    THE    DICTIONARY    LIST    STRUCTURE.  * 
**************************************************/ 

CCL    DEFINE    PNTRY; 
DEFINE:     PROC    RECURSIVE: 
DCL    CI    CHAR(l) : 
DISPLAY(  'ENTER    WHRD    POR    DICTION&PY    ENTRY') 

RPPLV(IPT) ; 
L    =    LENGTH* IPT) ; 
BP,L2    =    LI: 

DO    I    =    1    TO   L: 
CI    =    SUBSTR(TDTtI ,1) : 
L3    =    L00KLIST*L2,C1) : 
IF     I    =1    THEN    PP    =    L3: 
IF    L3    =    NI  L    THEN    HP; 

IP    BP    =    L2    THCN    !PT    =    SUBSTR *  I PT ,1  , 1 ) : 

ELSE    IPT    =    SUBSTRUPT, 1,1-1)? 

DISPLAY* 'LETTERS    ALREADY    IN    DICTIONARY    '|!lt>T) 

CALL    PRINTS(PP)  : 

DISPLAY* 'PNTER    REMAINDFP    nc    WOPD    AND    ' 

I  I  'ITS    ATTRIBUTES    IN    DICTIONARY    FORM'): 
BP->CDRF    =    READS:    END: 
ELSE    *P,L2    =    L3:    END: 
CALL    PRINTS*  PP)  : 
DISPLAY* 'DICTIONARY    COMPLETE    =    0»    ELSE    1') 

REPLY(IPT) : 
FLD    =    IPT:    IPT    =    "  : 
IF    FLD    THEN    CALL    DEFINE: 
END    DEFINE? 

/************************************************** 

*  PROCEDURE    WHICH    TS    USPD    TO    CALL    THE    H^SHX  * 

*  SUBROUTINE    TO    RETRIEVE    THE    NUMERIC    Cnnc  * 

*  WORDS    ASSOCIAtpo    WITH    THE    TQPTC    NAME.  * 

****************************************** A* ****rf«*/ 

TABLP:    °Pnc*C); 

DCL    C    CHAR(?0)i     TBLCODE    FIXED    BIN    INTT*0); 

CALL    HASHX(C,CnnE t5,l ,1 , TBLCOPE) : 

IP    TBLCODF    =    0    THcm    DISPLAY*'       'OP^C    NAME    AND    ' 

|j  'NUMERIC    rpr>E    WHRn    **    '||C): 
ELSE    DISPLAY*'       **    TOPIC    NOT    IN    ""ABLE    **       MIC): 
END    TABLE: 
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*  DECLARATION  HP  THF  OLORA.L  VARIABLES  USED      * 

*  AND  DISPLAY  Oc  COMMAND  REQUESTS  TO  THC        * 

*  OPERATOR.  * 

DCL     (Ll»L2,L3,RPtPP)PTR,STNAME    CHAR  ('-0  )  ,  I  PT    CH AR ( 80) V AR , 

(TRCtENC  tTRIT^RP,cLD,TRLTDCT)RIT(l)T 

( !,LtLEV,TRLSI7E)PI XED    RIN: 
TRCtTRI »ENCtERR,cLDtTRL,DCT=«0«R:    LEV=0:    TPL«I7E=11: 

*  INITIALIZATION    OF    the    DICTIONARY    LIST     'LI'.       * 

IF    PARMS    -«=     ''    THEN    HI  SPLAY(  '  ENTFR    OPTIONAL    PAP AMFT FR* ' ) 

REPLY( IPT) : 
yp    ipT    -,=     ii    then    GET    STRING(IPT)     DATA  ( TRC  ,TR  I ,  FLD, 

TBL    DCT):    IPT   *    ■ • ; 
CALL    INIT; 

**         ROUTINE    C0R    THE    OPERATION    OF    THE    PROGRAM.         ** 

DO    WHILE(  'l'R)  : 

DISPLAY( «     ')  : 

DISPLAY*  'ENTER    NAME     AND    SMC    NUMRFRi)     REPLY (ST NAMP) ; 

DISPLAY( '     •)  : 

IF    TRI     THEM    DI  SPLAY(  'FNTER    OPTIONAL     PARAMETERS,'     !! 

•IF    NONE    THEN    RETURN')     RERLY(TPT): 
IF    IPT    -,=     '•     THEN    GET    STRINGUPT)     D AT A* TRC, ENC, 

TRI  ,FLDtTRL,DCT)  :    IPT    =    «  •  : 
IF    DCT    THEN   CALL    DE^I NE : 
ELSE    DO; 

L2=PEADS; 

CALL  PRINTCL2) : 

IF  AT0M(L2)  THEN  IF  SYMR OL ( L2 ) = • STOP •  THEN  EXIT; 

DISPLAY*'     •); 

DISPLAY(  '  ft****** ***#***)((***)!(********** '  ); 

DISPLAY*  •  • II STNAMF) : 

DI  SPLAY(  '  a**********^**************:***  i  )  • 

DISPLAY* '     •)  ; 

CALL    &TTPR; 

DISPLAY*'     •); 

CALL    PRINTS(L2) ;    END? 
IF    TRL    THFN    CALL    TABLF(«    **       TOPIC    NAM=       **     i); 
DISPLAY*  •     •) : 
DISPLAY*  •**    AT    THIS    POINT    YOUR    REQUEST    WOULD' I  I 

'    BE    PASSED    TH    THE    • ) : 
DISPLAY*'  TRANSLATION    AND    TARLE    SEARCH    '|| 

•ROUTINES.    **' ) : 
DISPLAY* '     • ) ;    END: 
END    LIR    SEP: 
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**  TABLE    PROGRAM  ** 

TABLE:    PROC(PARMS)     OPTI  ONS( MAI N)  : 

DCL    CODES    FILE    STRFAM    ENVIRONMENT    (F(«o)): 
DCL     (TRCtSWT)BIT(l) ,( CODE (11) t ALFA) CHAR (20) , 

< I,DATASIZE,NPKFY,TBLSI 7E,TYP0PtTBLC0DE)FTXFD    BIN, 

(PARMS, IPT)CHAR( 2)VAR : 

*  DECLARATION    OF    A    PILE    CONTROL    BLOCK    FOR  * 

*  USF     IN    THE    RE£DIN    OF    THE    EXTERNAL    FILE.  * 

#*%%***# Z************* ***********  *****************/ 

DCL     1    FCB     STATIC, 

2    COMMAND    CHARC  B)  . 

2    FILENAME    CHAR<8)     I  MI  T(  '  CODES  •  )  , 

2    FILETYPE    CHAP(P)     I  MI T( • DATA' ) , 

2    CARDNUM    FIXED    PIN, 

2    STATUS    FIXED    PIN, 

2    CAPD_BUFFER    CHAP(PO): 

/%********■%*%************************************** 

*  PROGRAM    HPFR&TION  * 
******%**%;* *&&* **■%;** **&■%*****$:*****&**************  / 

IF    PAR  MS    -t=     ■«    THEM    TPC    =    »1'B: 
ELSE    TRC    =    'O'B: 
TBLSIZE    =    Us     TBLCODE    =    0; 
DATA  SIZE    =    5;    NPKEY    =    Is 

/^^.^^^■^^k  9^*^************^****  ********  *************** 

*  ROUTINE    TO    READ    THE    CODES    FILE    INTO    THE  * 

*  ARRAY    FOR       USE    DUPING    THE    PROGRAM.  * 

DISPLAY( 'INITIALIZATION    OP    THE    CODE    ARRAY'): 
COMMAND    =     'RDBUF' ; 

DO    I    =    1    TO    TBLSIZE: 

CARDNUM    =    I : 

CALL    IHEFILE(PCB) : 

CODE( I  )    =   CARD    BUFFER: 

IF    TRC    THFN    DT5PLAY( 'CODEf ■  I  II  I  I  •)        ' II  CODE ( I ) ) S     END: 
COMMAND    =     'FINIS' : 
CALL     IHEFILE(FCB) s 
DISPLAY( 'COMPLETION    OF    INITIALIZATION'): 

/************#&*****1lt4c*iZik$t***ikili*2c*1if;*i!L$:&4:  •/&%%*  ***%%*■$. 

*  ROUTINE    WHICH    ALLOWS    THE    CONTTNUAL    STQRF,  * 

*  DELETION,    or    RETRIEVAL    OF    IT^MS    FROM    THF  * 

*  ARRAY    UNTIL    THE    OPERATOR    1^    FINISHED.  * 

**&*&**&********><**************************;**£*!****/ 

SWT    =• l'B; 

DO    WHILE( SWT) : 

DISPLAY( 'ENTER    TYPE    OF    OPERATION:    STORE=0 , R ETP I EVE=1 ' 

I  1  ',DELElr=-l'  )     P-PLYUPT)  : 
TYPOP    =    IPT; 

DISPLAY* «ENTCR    ITEM    REQUIRED')     REPLY ( ALCA) : 
IF    TPC    THEN    DISPLAY* 'CALL    HASHXMs 

X  ^c  sft  sfc  ^;  j^"  ^c  "^c  jfe  ijr  ijc  aj:  5^c  afc  ^c  ^c  si  ajt  >Jc  ;^f  ^Jc  ^  ?fr  jjr^ftf^sjc^c^^cifc^y'^ca^i^'-  ay^c^ksfe^^trrfc  5^c  a5r  ^A:  ^c  >h  3^:  a^  5^ 

*  THE    FOLLOWING    ARE    THE    PARAMETERS    REOUIPFD:  * 
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*  ALFA    =    ITEM    UNDER    CONSIDERATION  * 

*  CODE    =    THE    TABLE    ARRAY    ADEA  * 

*  OATASI7E    =    NUMBER    np    COMPUTFR    WORDS     IN  * 

*  THE     ITM    BEING    ("ONSTDFPED  * 

*  NRKCY    =    NUMBER    OF    ITEMS,     NORMALLY    ONE  * 

*  TYPOP    =    TYPE    OP    OPERATION  * 

*  TBLSI7E    =    THE    TABLC     SIZE     (PRIME    NUMBED)  * 

*  TBLCOOE    =    THE    TABLE    CONFIGURATION    CODE     (0)  * 

CALL     HASHX( ALFA ,C ODE ,DAT ASI 7E , NRK^Y ,TYPOD ,T BLS I ZE, 
TBLCODE)  : 

*  THE    FOLLOWING    *RC    TH=    TABLE    CHDES    WHICH  * 

*  RESULT    FROM    THE    CALL    TO    HASHX:  * 

*  0    =    OPERATION    SUCCESFULLY    COMD|_ETED  * 

*  1     =    TABLE    IS    FULL  * 

*  2    -    ITFM    TS    N0T    IN    THE    TABLE  * 

DTSPLAY( 'TABLE    CODE     'IJTBLCOnE): 

IF    TYPOP    =    1    THEN    DI SPL AY( ' RETPI E VED    ITEM    AND    CODE: 

I  IALFA  )  ; 
DISPLAY(«IF    FINISHED    ENTER    0,    ELSE    1')     REPLY(IPT); 
SWT    =    IPT:    end: 

/  *  A  ije^c  ajc  #  #  #  %■&  *  *#  *  £  *  A  *  *  afc  *  :£  s}c  A  A  *  *  *  5fe  *3je  s)c  £  *&  A*  A &  *  **  *  *  £  **  &  A  * 

*  ROUTINE    USED    T0    WRITE    THE    APRAY    BACK    INTO  * 

*  THE    COOES    FILP.  * 

COMMAND  =  'ERASE*  : 
CALL  IHEFILE(FCB) : 
COMMAND    =     'WRBUF' ; 

DO    I    =    1    TO    TBLSI7E : 

CARONUM    =    I  ; 

IP    TPC    THEN    DISPLAY(  «CODE(  '  I  I  I  I  I  ')        'MCODE(I)): 

CARD    BUFFER    =   CODE(I): 

CALL     IH=FILE(FCB) :    END? 
END    TABLE: 
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* 

*  **  ** 

*  **  HASHX    SURROUTINc  ** 

* 

MACRO 
T.Ml       HAKFY 

*  MACRO  USED    TO    CCNSTRUCT    THE    HASH    COPE    DISPLACEMENT 
6N1       LM  At5»TPMP+2^    LOAD    THE    COLLISION    CONSTANTS 

LM  6,7,0(11)  ESTABLISH    HASH    CODE    IN^EX 

XP  6,7  REDUCE    THE    KEY    TO    ONE    WORD 

SRDA  6,32 

D  6,TEMP+12         HASH    CODE     INDEX 

LR  7,6 

MR  6,2  HASH    TABLE    DISPLACEMENT 

C  7,=c»0' 

BNL  *  +  6 

LCR  7,7  LOAD    COMPLFMENT    IF    NEGATIVE 

LR  12, 7 

AP  12,15  ESTABLTSH    H&SM    TftPLE    FNTPY    POTNT 

MEND 
* 

MACRO 

£N2       KEYCK  CBP1 

*  MACRO  USED    T0    COMPARE    THE    IMPUT    WITH    THE    STnpED    ITPM 
SN2       LM  8,10,0(11)       LOAD    THE    CIRST    THR"    WORDS    OF    DATA 

CL  8,0(12)  CHECK    THE    CIRST    WORH    ^QR    MATCH 

BNE  T.BP1  BRANCH    IF    NOT    SAME 

CL  9,4(12)  CHCCK    THE    SECOND    WHPD 

BNE  £BP1  BRANCH    IF    NOT    *AME 

CL  10,8(12)  COMPARE    THE    THIRD    W^RD    C0R    m/vtch 

BNE  F.BP1  BRANCH    IF    NOT    SAME 

MEND 

MAC1?0 

EN3      HCOLL  SBP2,SBP3 

*  MACRO  USED    TO    COMPUTE    THE    COLLISION    DI*  DL ACPMENt 

r.N3       AR  4,2  ADD    DATA    SI7E    T0    LOWPR    COLL    CONST 

CR  4,5  CHECK    TO    SEE    IF    WITHIN    t*PLF    AREA 

BNL  F,8°3  IF    N0T    GO    TO    TABLE    FULL    ROIFIME     ,, 

BXLE  7, 4,*+ 6 

SP  7,5 

LR  12,7 

AR  12,15  RECOMPUTE    NEW    HASH    TABLE    ENTRY 

B  8BP2 

MEND 
# 

MACRO 

SN4      MVCHA  6R1,6R2,CBP1 

*  MACRO  USED    TO    MOVE    THE    DATA    PROM    INPUT    Tn    HA^H    TABLE 

*  OR    FROM    HASH    TABLE    TO    OUTPUT 

6N4       AR  2,1  REDUCE    NUMBER    hf    CHAR     BY    ONE 

EX  2,*+8  MODIFY    THE    NUMBFR    HP    CHA&     MOVED 

B  *  +  10 

MVC  0< 1,6R1.) ,0(SR2.) 

SR  2,1  RESTORE    THE    NUMBER    OF    CHARACTERS 

AR  11,2 

BCT  3,F,BP1       CHECK  TO  SCE  Ic  ALL  DATA  PPOCFSSED 

B  EXIT         IF  FINISHED  BRANCH  TO  EXIT  ROUTINE 

MEND 
* 

MACRH 
EN5       TPLFUL     T.NR 

5N5      L  1,TEMP+16 

L  2,=F«ENR.» 

ST  2,0(1) 
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B  FXIT 

MEND 

* 

*  AFTER     INITIALIZATION    THE    cOLLOWING    ARE    THE 

*  REGISTER    AS?tGNMFNts. 

*  R-0    CONTAINS    A    COMSTANT    BLANK    FOR    COMPAPTCPNC 

*  R-l    CONTAINS    A    CONSTANT    mi  NU^    ON'P 

*  R-2    CONTAINS    THE    DATA    LENGTH    IN    BYTES 

*  R-3    CONTAINS    THF    NUMBER    OP    TTFMS    tO    BE    PROCESSED 

*  R-4    CONTAINS    A    CnLLISION    CONSTANT    ((P#-1)/2*P?) 

*  R-5    CONTAINS    A    COLLISION    CONSTANT       (P#*R2) 

*  R-U    CONTAINS    THE    CURRENT    OATA    A^DRFSS 

*  P-12    CONTAINS    THE    HA^H    CODE    DISPLACEMENT 

*  R-13    CONTAINS    th^    PROGRAM    BASE    ADDt?cSS 

*  P-14    CONTAINS    THE    DATA    RASE    ADD&FSS 

*  R-15    CONTAINS    THP    HASH    TARLE    RAcE    ADDRESS 

*  R-6    TO    R-10    ARE     WORKING    REGISTUPFS 

HASHX  CSECT 

USING    *,15 

B  12(0,15)  BRANCH    AROUND    IDENTIFIER 

DC  ALK6) 

DC  CL6«HASHX'        IDENTIFIER 

STM  14,12,12(13)     SAVE    REGISTURES 

LP  12,13 

CNOP       0,4 

BAL  13,*+76 

DROP       15 

USING    *,13 

DS  18F 

ST  13,8(12) 

ST  12,4(13) 

* 

*  BEGIN  PROGRAM 
* 

LM  14,15,0(1)  LOAD  KEY  ADDRESS  AND  HASH  TABLE  BASF 

LM  2,6,8(1)  LOAD  PARAMETERS 

L  2,0(2)  LOAD  KEY  AMD  DATA  SI7P 

L  3,0(3)  LOAD  NUMBER  OF  KEYS  OR  ^ATA 

L  4,0(4)  LOAD  TYPE  0^  HP ( RET  =  1 ,ST=0 , 0EL=-1 ) 

L  5,0(5)  LOAD  TABLE  SI7E  (PRIMP  NUMREP) 

CNOP  0,4 

B  *  +  36 

TEMP    DS  8F  STORAGE    SDACE    C0R    PARAMETERS 

STM  2, 6, TEMP 

SLA  2,2  KEY/DATA    ST7E    TIMcS    POUR 

ST  2,TFMp+20  TEMP    STORE    Pc    DATA    S I  7  "=    IN    RYTES 

LR  7,5  ESTABLISH    COLLISION    CONSTANTS    Tn    * 

BCTR  5,0 

SRA  5,1  SUBTRACT    flNE    FROM    TABLE    S  I  7E 

MR  4,2  MULTIPLY    TABLE    SI7F    BY    DATA    SI7F 

SLA  7,2  MULTIPLY    TABLE    SI?E    BY    CPUR 

MR  6,2  MULTIPLY    4-TABLE    SI7E    BY    DATA    SI7E 

ST  5,TEMP+24  # —    TEMP    STORE    OF    CONSTANTS 

ST  7,TEMP+28 

LR  11,14 

L  1,=F'-1«  ESTABLISH    CONSTANT    MT  NUS     ONE 

L  0,=C«  '  ESTABLISH    CONSTANT    BLAN* 

L  6,=F'0«  ESTABLISH    7ER0    CONSTANT 

C  6,TEMP+8  CHE<~K    TO    SEE    1^    STOR*    hdprattoN 

BE  L0P1  BRANCH    TO    STORE    ROUTINE 

C  l,TFMP+8  CHECK    TO    SEE    Ic    DELET    OPERATtqn 

BE  OELT  BRANCH    TO    DELCTION    RDUTINC 

B  RTPV  BRANCH    TO    RETRIEVE    POUTIN^ 

* 

*  STORAGE    ROUTINE 

LHP1    HAKEY 

L0*>2    C  0,0(12)  CHECK    cqr    CMPTY  SLOT 

BE  *  +  16  IF    PMPTY    BRANCH  Tp    CHARACTER    MQVE 

C  1,0(12)  CHECK    TO    SEE    1=  ITEM    DELETED 
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PE  *+8  IF    DELETED    BR4NCH    TO    MHVE    ROUTINE 

B  FSLT  BRANCH    TO    FULL    SLOT    ROUTINE 

MVCHA  12,ll,LOPl 

FSLT    KEYCK  CLS2 

MVCHA  12,ll,LOPl 

CLS2    HCOLL  LOP2,Tf=Ll 
* 

*  RETRIEVE    ROUTINE 

PTRV    HAKEY 
LOP3    KEYCK    CLS1 

MVCHA     11,12»RTRV 
CLS1    HCHLL    L0P3,TFL2 
* 

*  DELETION  ROUTINE 
♦ 

DELT  HAKFY 

L0P4  KEYCK  CLS3 

ST  1,0(12) 

ST  1,4(12) 

BCT  3, DELT 

CLS3    HCOLL  L0P4,TFL2 
♦ 

*  TABLE    FULL    ROUTINF 
♦ 

TFL1  TRLFUL  1 
TFL2  TBLFUL  2 
* 

*  RETURN    ROUTINE 
* 

EXIT  L  13, A( 13) 

LM     14,12,12(13)  RESTORE  REGTSTIJRES 

MVI    12(13), X'PF'  SET  PFTURN  INDICATION 

LA     15,0(0,0)  LCAD  RETURN  CODE 

BR     14  RETURN 
END 
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**  DICTIONARY    CILE  ** 

*&  #* 

sjcsfssjcs^slesjcslcsicsjssic  A  ^t  j{;  :jt  5{c  j{:  ^c  A  j{;  ^t  jjs  ^  ^c  dt  ^c  ^c  ^£  j)c  *&  &sj:s(cs^  A*  # ***&*#&***  #  A  $  / 

(  1<MSPLR<S<*<PLUP>»  ME  SPLR<E<S<*<PLUP>»>  M|_Y<L<Y<*<-Anj  d 
ADVBP»»  MION<KO<N<*<-VPRBP  NOUNP»>»  MI  MR<KN<r,<*<  DPS  PT 
»>»  MAL<A<L<*<-NOUNP  ADJP»»  MED<p<n<*<o  A$T»>>  MFST<e<S 
<T<*<SUPPL>»»  MER<E<R<*<CPMPP»»  MIC<I<C<*<-NPIJNP  Anjo>> 
»MMENT<M<E<N<T<*<-VFRRP  MOUNP»»»  M£  RLE<A< R<l.<E<*<-V E" BP 
ADJP>»»>    MICS<KC<*<-NGUNP    ADJP>    S<*>>»    MO<rD<*<PAST»» 

E         <L<E<C<T<R<NOUNP    0<N</^ICS>    D<E<*»    L<Y<S<XS<*»»»    I 
<C<KT<Y<*>»    /M4L>>>>>»> 

T         <0<*<PRFPP    ADVRP>>    H<F<*<ADJP    DECART»    A<T<*<PPNQUNP 

ADJP>>>>    E<L<c<M<F<T<P<Y<*<NniJNP>»>>>»» 
A  <*<ADJP     INDAPT>    N<*<ADJD    TNOAPT>->    |_<L<*<AOJP>>>    C<n<lKS 

<NOUMP    T</MICS»>>    0<U<T  <p<VFRR^    E<*    /MD>    /MTNG»»>     B< 

0<U<T<*<PRFPD»»>    M<*<VEPRD>>> 
0  <B<T<A<KM<VFPBP    *    /MING>»>>    PPFPP    N<*>    F<*>    U<T<*< 

-PREPP    ADVBP>»> 
T  <*<PRNOUN>    N<*<PRcdp>    P<NniJNP    f<*    R<v<*<-MHUNP    VCRRD>     A 

<T</MinN»»»    T<c<R<p<s<T<VERBP    *    S<*>    /MED>»»»    S<* 

<VERBP>>> 
N         <A<T<U<P<E<*<NHUNP>>    /MAL»>>> 
S  <0<M<E<*<ADVRD>»>    E<N<0<VEPBP    *    /M!NG»» 

R  <E<F<E<R<VEP«P    *    R</MING    /MEn»>>    P<0<R<T<NCUJNP    * 

/MSPLP>>>»> 
H         <A<VERBP    S<*<PR?SG>>    D<*<PAST»    V<E<*<PRN3SG»    /MING» 

P<L<D<KN<G<NOUNP    *    /^SPl_P>>»>>> 
L  <A<N<G<U<A<G<E<NCUNP    *    /MSPLR»>>»>    I<K<F<*<'VrpRP>» 

NOUNP    S<T<I<M<G<*    /MSPl_P>»>>    N<G<U<KS<T<'/mic?»»»» 
D         <0<VERPP    *    /MTN|G>    F<S<T<R<0<Y<F<'R<NnU^P    *    /MSPLP»>»» 

»  U 

C         <A<L<C<U<L<U<S<*<NOUWP»»»»    0<U<P<L<E<P<NPUNP   * 

/MSPLR>>»>    M<p<U<T<F<R<NOUNP    *    /WSPLP>»»>» 
F         <KN<D<VERBP    *    /MTNG»» 
M         <E<*<PRNOUNP>» 
W         <A<N<T<*<VERBP>>>>    0<U<L<D<*<VFRBP»»>    T<T<H<*<PREPP» 

>>    H<KC<H<*<ADJP»»    A<T<*<TNT«=p    ADVRP»>» 
Y         <0<U<PRNOUNP    *    R<*<POSSP>»» 

G         <KV<VEPBP    E<*>    /MTNG»    A<V<E<VPopP    PA^T>»> 
P         <L<E<A<S<E<*<VCPRP>>>»» 
B         <KB<L<KO<G<R<A<P<H<NOUNP    Y<*>    KC</MAL>»>»»»»> 

V      K       X       J      0       Z 

<*<PUNTP» 
,  <*<PUNTC» 

?         <*<PUNTT»)J 
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